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<210> 1 
<211> 3856 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> INDEHISCENT1 (IND1) genomic 

<220> 

<221> CDS 

<222> (2765) . . (3361) 

<223> IND1 



<400> 1 
ctctagacca 


tctactatcc 


ggttgttgac 


ccttaaagct 


tttgaagact 


actagaataa 


60 


tgcaaatacc 


atatgtccat 


atccatcctt 


ttcttttgtt 


tgaactgaac 


attctaattt 


120 


tgtaaaagaa 


aaaaccttat 


gttaatatca 


ccgtaggcaa 


aaaaaatatc 


tcatcatatt 


180 


aaatttttat 


tataagatta 


tacattctct 


cgttgtaaga 


gttactccaa 


ttgcaagtgt 


240 


tgtattaact 


aataaaaagg 


acgaaagtag 


gaagcttata 


attaattgat 


gttgcatagt 


300 


actggtatat 


tgttgatgaa 


tataacaagt 


atgaacatta 


atgcatgaaa 


cggggtattt 


360 


tgtcttgaac 


tcattaaagg 


caatgtgaaa 


agaagatgtg 


aggtctcatt 


ttgaaaattt 


420 


atcttctagc 


tttgtcgatt 


ttaaatctat 


gaaatgaacg 


caacatatag 


aaatttcatg 


480 


tggacaacga 


catttagacg 


gtatcttaat 


tagaccgatt 


aattagtaat 


atacttatat 


540 


atataattag 


tggtgattat 


aagtttactt 


atccacttga 


gaatttaaac 


aatgggcaat 


600 


accttaatgt 


cgaaagaagc 


cgtccccact 


tcgtgtaatg 


agttatgggg 


gagagatcct 


660 


gttaaatcgt 


caaataaaac 


aacttaagaa 


ctagaaattg 


acaccaaaaa 


tcataaagag 


720 



1 



i 



aacgttgaag 


aagtcattta 


tcgtatccag 


ctcatatttc 


ctagctaaga 


tcaaatcaag 


780 


gccgttgaaa 


gggcttgtaa 


gaaaatgtcg 


aagaaaccgt 


ggggtttaga 


agaaagacaa 


840 


gaaatagaag 


aacaatgatg 


ttaaattgcc 


tattttggtg 


tataggagtt 


gtcaaaagag 


900 


gagagagaga 


agaaaattag 


gtcaaaataa 


tgagcactaa 


aaatggagac 


atgtgttgag 


960 


taactattac 


aagagcgact 


tatgcttcct 


tatggcaatg 


atatccaaac 


caaagtgcaa 


1020 


cgctcctttt 


ttgccctaat 


ttcgtaaagt 


ctctctcctt 


cttcgtcctt 


aggaaaaacc 


1080 


ctagaaattt 


aatcccttgt 


tcttgatctt 


gctttttgag 


taaccatgat 


tttgaccaca 


1140 


cactatttct 


tctatctttt 


gtggtctata 


ggattttgct 


ttatatgtgt 


ttcttgtatt 


1200 


gctccgtacg 


tacgtatacg 


aatttaaatg 


gttataacaa 


ggtttatata 


aactagcaca 


1260 


aatgagtcca 


tgaaatttgt 


tagcgaaaaa 


ggtagaaata 


tattgagtct 


ttaaacggca 


1320 


atatatataa 


ttttgctgca 


aaacttagct 


ttaatcatga 


tctaatgata 


ttttctttaa 


1380 


tttcctttgc 


caaattaatc 


acatgcacgg 


atttttggca 


agttatgtgt 


cgaattcttc 


1440 


cattcacaca 


acactaaact 


taattagaac 


tctaggaaat 


attttaaaat 


gacaacttta 


1500 


tcgaaaaaaa 


tttagttatg 


aaaacaattc 


cagaattaaa 


catgagctat 


ataatttaag 


1560 


ataaaatgaa 


gtaatattga 


tatgtatgta 


ataacatatc 


tgattgcggt 


aaaaaaaaac 


1620 


atatctgatt 


aaattgttca 


tgcaggccca 


tgtcactatg 


atgtcatcac 


gtttttattt 


1680 


tcacaataac 


taatatatat 


tcaaaaaaat 


agttttgtca 


gattaaattt 


tttttggtgg 


1740 


tcagctttct 


ccaacctact 


aaactagttt 


ggaatgttct 


cttctttatt 


tttctttttc 


1800 


ttgatttctt 


atgtttttta 


tttatggaat 


tttaagacgg 


attgtttagg 


tcgtttctct 


1860 


cttttcttgt 


tttctaaagt 


tacttttgta 


aactcatctc 


ctcccaatta 


gacagtcaat 


1920 


catatagtta 


tcttttaata 


tatgtctagt 


tgataaaaaa 


aatgaaaaaa 


tactggtggt 


1980 


agttctacta 


atgtttgtgt 


aaaaaatctg 


atattatgaa 


tctaatcaat 


ttctttgatc 


2040 


gtataatgtg 


ggttaaattt 


agtaattttt 


tacataaata 


agaactgtaa 


tgttgatgta 


2100 


tattggggaa 


tcagtatatt 


agcttgggta 


actatacttc 


tggaaatact 


tgaagattta 


2160 


actatttgca 


aaattataat 


ttagtcccga 


aaaatacaga 


cgacgggaca 


cgacaacata 


2220 


taagcaggtt 


tgaatcttgg 


aaaattttgt 


atacataacc 


tatataaata 


ctaatgttct 


2280 


ggttgggttc 


aaaagccttt 


tcaaaagttc 


cattttttaa 


attcaaggac 


attttacata 


2340 


ggaaataagt 


tgagtcataa 


aaaataatgg 


ttattttgta 


aggttttttt 


tttgattaaa 


2400 


acgcacatat 


taagaagtta 


gttttttttc 


actaccaaat 


atcaattaat 


ttaaaaccat 


2460 


gcaaccattc 


ataaaacaat 


actattaaag 


aatataaata 


atcacaaaat 


attaaataca 


2520 
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t 



cttaaaattt acatataaat ttacaaaaca tctaattaat tgaaacagaa aggaaaaggt 258 0 

aaaatatatc ataaaatgag acatatatcc tataaaaaaa aaatgaggca tatgaagtaa 2 64 0 

ataataagag acatgcatgt aagcattcgg ttaattaatc gagtcaaaga tatatatcag 2700 

taaatacata tgtgtatatt tctggaaaaa gaatatatat attgagaaat aagaaaagat 2760 

gaaa atg gaa aat ggt atg tat aaa aag aaa gga gtg tgc gac tct tgt 2 80 9 
Met Glu Asn Gly Met Tyr Lys Lys Lys Gly Val Cys Asp Ser Cys 
15 10 15 

gtc teg tec aaa age aga tec aac cac age ccc aaa aga age atg atg 2 857 
Val Ser Ser Lys Ser Arg Ser Asn His Ser Pro Lys Arg Ser Met Met 

20 25 30 



gag cct cag cct cac cat etc etc atg gat tgg aac aaa get aat gat 
Glu Pro Gin Pro His His Leu Leu Met Asp Trp Asn Lys Ala Asn Asp 

35 40 45 

ctt etc aca caa gaa cac gca get ttt etc aat gat cct cac cat etc 
Leu Leu Thr Gin Glu His Ala Ala Phe Leu Asn Asp Pro His His Leu 
50 55 60 

atg tta gat cca cct ccc gaa ace eta att cac ttg gac gaa gac gaa 
Met Leu Asp Pro Pro Pro Glu Thr Leu He His Leu Asp Glu Asp Glu 
65 70 75 



gee gtc atg cag ccc gta gac ate gac cct gee acg gtc cct aag ccg 
Ala Val Met Gin Pro Val Asp He Asp Pro Ala Thr Val Pro Lys Pro 

100 105 110 



2905 



2953 



3001 



gag tac gat gaa gac atg gat gcg atg aag gag atg cag tac atg ate 3 04 9 
Glu Tyr Asp Glu Asp Met Asp Ala Met Lys Glu Met Gin Tyr Met He 
80 85 90 95 



3097 



aac cgc cgt aac gta agg ata age gac gat cct cag acg gtg gtt get 314 5 
Asn Arg Arg Asn Val Arg He Ser Asp Asp Pro Gin Thr Val Val Ala 

115 120 125 

cgt egg cgt egg gaa agg ate age gag aag ate cga att etc aag agg 3193 
Arg Arg Arg Arg Glu Arg He Ser Glu Lys He Arg He Leu Lys Arg 
130 135 140 

ate gtg cct ggt ggt gcg aag atg gac aca get tec atg etc gac gaa 3241 
He Val Pro Gly Gly Ala Lys Met Asp Thr Ala Ser Met Leu Asp Glu 
145 150 155 

gee ata cgt tac ace aag ttc ttg aaa egg cag gtg agg att ctt cag 3289 
Ala He Arg Tyr Thr Lys Phe Leu Lys Arg Gin Val Arg He Leu Gin 
160 165 170 175 

cct cac tct cag att gga get cct atg get aac ccc tct tac ctt tgt 3337 
Pro His Ser Gin He Gly Ala Pro Met Ala Asn Pro Ser Tyr Leu Cys 

180 185 190 

tat tac cac aac tec caa ccc tga tgaactacac agaagctege tagctagaca 3391 
Tyr Tyr His Asn Ser Gin Pro 
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tttggtgtca tcctctcaac ctttttcatg ttgatatatt atatatagat gcataaagat 3451 



3 



tcgatccaag attgtatggg -tgttttaata ttattattct aagatatatg atgtacaatt 3511 
gtgtaccaag tttctttatc ttgatatcat atgcataaat aattggtgaa taaaaagaag 3571 



atattgattg taaacaaaaa aaagaagata ttgattgtta attagggttt gatcattctg 3631 
tatgaaagct ttggcctgca aattaatttt cgatatatat atatatatat ggagaatata 3691 
tatcaaatac ttttttaatt tgactataat ttgtatcaat tatctgaatc tgatgagtgt 3751 
aggttatata tggattagca aaaaagaaaa caaccattat tacgcaccta cattaaaaat 3 811 
catccaccaa agaagaaacc atcctcaaga gggttccctc tagag 3 856 



<210> 2 
<211> 198 
<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<223> INDEH I S CENT 1 (IND1) protein 
<400> 2 



Met 


Glu 


Asn 


Gly 


Met 


Tyr 


Lys 


Lys 


Lys 


Gly Val 


Cys 


Asp 


Ser 


Cys 


Val 


1 








5 










10 










15 




Ser 


Ser 


Lys 


Ser 


Arg 


Ser 


Asn 


His 


Ser 


Pro 


Lys 


Arg 


Ser 


Met 


Met 


Glu 








20 










25 










30 






Pro 


Gin 


Pro 


His 


His 


Leu 


Leu 


Met 


Asp 


Trp 


Asn 


Lys 


Ala 


Asn 


Asp 


Leu 






35 










40 










45 








Leu 


Thr 


Gin 




His 


Ala 


Ala 


Phe 


Leu 


Asn 


Asp 


Pro 


His 


His 


Leu 


Met 




50 










55 










60 










Leu 


Asp 


Pro 


Pro 


Pro 


Glu 


Thr 


Leu 


He 


His 


Leu 


Asp 


Glu 


Asp 


Glu 


Glu 


65 










70 










75 










80 


Tyr 


Asp 


Glu 


Asp 


Met 


Asp 


Ala 


Met 


Lys 


Glu 


Met 


Gin 


Tyr 


Met 


He 


Ala 










85 










90 










95 




Val 


Met 


Gin 


Pro 


Val 


Asp 


He 


Asp 


Pro 


Ala 


Thr 


Val 


Pro 


Lys 


Pro 


Asn 








100 










105 










110 






Arg 


Arg 


Asn 


Val 


Arg 


lie 


Ser 


Asp 


Asp 


Pro 


Gin 


Thr 


Val 


Val 


Ala 


Arg 






115 










120 










125 








Arg 


Arg 


Arg 


Glu 


Arg 


lie 


Ser 


Glu 


Lys 


He 


Arg 


He 


Leu 


Lys 


Arg 


He 




130 










135 










140 










Val 


Pro 


Gly 


Gly 


Ala 


Lys 


Met 


Asp 


Thr 


Ala 


Ser 


Met 


Leu 


Asp 


Glu 


Ala 


145 










150 










155 










160 


lie 


Arg 


Tyr 


Thr 


Lys 


Phe 


Leu 


Lys 


Arg 


Gin 


Val 


Arg 


He 


Leu 


Gin 


Pro 










165 










170 










175 




His 


Ser 


Gin 


lie 


Gly 


Ala 


Pro 


Met 


Ala 


Asn 


Pro 


Ser 


Tyr 


Leu 


Cys 


Tyr 








180 










185 










190 






Tyr 


His 


Asn 


Ser 


Gin 


Pro 
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<210> 3 
<211> 2765 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> 5* untranslated region IND1 promoter 



<400> 3 



ctctagacca tctactatcc ggttgttgac 
tgcaaatacc atatgtccat atccatcctt 
tgtaaaagaa aaaaccttat gttaatatca 
aaatttttat tataagatta tacattctct 
tgtattaact aataaaaagg acgaaagtag 
actggtatat tgttgatgaa tataacaagt 
tgtcttgaac tcattaaagg caatgtgaaa 
atcttctagc tttgtcgatt ttaaatctat 
tggacaacga catttagacg gtatcttaat 
atataattag tggtgattat aagtttactt 
accttaatgt cgaaagaagc cgtccccact 
gttaaatcgt caaataaaac aacttaagaa 
aacgttgaag aagtcattta tcgtatccag 
gccgttgaaa gggcttgtaa gaaaatgtcg 
gaaatagaag aacaatgatg ttaaattgcc 
gagagagaga agaaaattag gtcaaaataa 
taactattac aagagcgact tatgcttcct 
cgctcctttt ttgccctaat ttcgtaaagt 
ctagaaattt aatcccttgt tcttgatctt 
cactatttct tctatctttt gtggtctata 
gctccgtacg tacgtatacg aatttaaatg 
aatgagtcca tgaaatttgt tagcgaaaaa 
atatatataa ttttgctgca aaacttagct 
tttcctttgc caaattaatc acatgcacgg 
cattcacaca acactaaact taattagaac 
tcgaaaaaaa tttagttatg aaaacaattc 
ataaaatgaa gtaatattga tatgtatgta 
atatctgatt aaattgttca tgcaggccca 
tcacaataac taatatatat tcaaaaaaat 
tcagctttct ccaacctact aaactagttt 
ttgatttctt atgtttttta tttatggaat 
cttttcttgt tttctaaagt tacttttgta 
catatagtta tcttttaata tatgtctagt 
agttctacta atgtttgtgt aaaaaatctg 
gtataatgtg ggttaaattt agtaattttt 
tattggggaa tcagtatatt agcttgggta 
actatttgca aaattataat ttagtcccga 
taagcaggtt tgaatcttgg aaaattttgt 
ggttgggttc aaaagccttt tcaaaagttc 
ggaaataagt tgagtcataa aaaataatgg 
acgcacatat taagaagtta gttttttttc 
gcaaccattc ataaaacaat actattaaag 
cttaaaattt acatataaat ttacaaaaca 
aaaatatatc ataaaatgag acatatatcc 
ataataagag acatgcatgt aagcattcgg 
taaatacata tgtgtatatt tctggaaaaa 
gaaaa 



ccttaaagct tttgaagact actagaataa 60 
ttcttttgtt tgaactgaac attctaattt 120 
ccgtaggcaa aaaaaatatc tcatcatatt 180 
cgttgtaaga gttactccaa ttgcaagtgt 240 
gaagcttata attaattgat gttgcatagt 300 
atgaacatta atgcatgaaa cggggtattt 3 60 
agaagatgtg aggtctcatt ttgaaaattt 420 
gaaatgaacg caacatatag aaatttcatg 480 
tagaccgatt aattagtaat atacttatat 540 
atccacttga gaatttaaac aatgggcaat 600 
tcgtgtaatg agttatgggg gagagatcct 660 
ctagaaattg acaccaaaaa tcataaagag 720 
ctcatatttc ctagctaaga tcaaatcaag 780 
aagaaaccgt ggggtttaga agaaagacaa 840 
tattttggtg tataggagtt gtcaaaagag 900 
tgagcactaa aaatggagac atgtgttgag 960 
tatggcaatg atatccaaac caaagtgcaa 1020 
ctctctcctt cttcgtcctt aggaaaaacc 1080 
gctttttgag taaccatgat tttgaccaca 1140 
ggattttgct ttatatgtgt ttcttgtatt 1200 
gttataacaa ggtttatata aactagcaca 1260 
ggtagaaata tattgagtct ttaaacggca 1320 
ttaatcatga tctaatgata ttttctttaa 1380 
atttttggca agttatgtgt cgaattcttc 1440 
tctaggaaat attttaaaat gacaacttta 1500 
cagaattaaa catgagctat ataatttaag 1560 
ataacatatc tgattgcggt aaaaaaaaac 162 0 
tgtcactatg atgtcatcac gtttttattt 1680 
agttttgtca gattaaattt tttttggtgg 1740 
ggaatgttct cttctttatt tttctttttc 1800 
tttaagacgg attgtttagg tcgtttctct 1860 
aactcatctc ctcccaatta gacagtcaat 1920 
tgataaaaaa aatgaaaaaa tactggtggt 198 0 
atattatgaa tctaatcaat ttctttgatc 2040 
tacataaata agaactgtaa tgttgatgta 2100 
actatacttc tggaaatact tgaagattta 2160 
aaaatacaga cgacgggaca cgacaacata 2220 
atacataacc tatataaata ctaatgttct 2280 
cattttttaa attcaaggac attttacata 2340 
ttattttgta aggttttttt tttgattaaa 2400 
actaccaaat atcaattaat ttaaaaccat 2460 
aatataaata atcacaaaat attaaataca 2520 
tctaattaat tgaaacagaa aggaaaaggt 2 580 
tataaaaaaa aaatgaggca tatgaagtaa 2 64 0 
ttaattaatc gagtcaaaga tatatatcag 2700 
gaatatatat attgagaaat aagaaaagat 2760 

2765 



<210> 4 
<211> 496 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> 3' untranslated region IND1 promoter 
<400> 4 

atgaactaca cagaagctcg ctagctagac atttggtgtc atcctctcaa cctttttcat 60 
gttgatatat tatatataga tgcataaaga ttcgatccaa gattgtatgg gtgttttaat 120 
attattattc taagatatat gatgtacaat tgtgtaccaa gtttctttat cttgatatca 180 



tatgcataaa taattggtga ataaaaagaa gatattgatt gtaaacaaaa aaaagaagat 240 
attgattgtt aattagggtt 'tgatcattct gtatgaaagc tttggcctgc aaattaattt 300 
tcgatatata tatatatata tggagaatat atatcaaata cttttttaat ttgactataa 360 
tttgtatcaa ttatctgaat ctgatgagtg taggttatat atggattagc aaaaaagaaa 42 0 
acaaccatta ttacgcacct acattaaaaa tcatccacca aagaagaaac catcctcaag 480 
agggttccct ctagag 4 96 



<210> 5 
<211> 5622 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<223> SHATTERPROOF1 (SHP1) genomic 
<220> 

<221> modif ied_base 

<222> (935) . . (941) 

<223> n = any nucleotide 



<400> 5 

agate tgcaa cagtgaaaag agaaaacaaa 
agataatget tattccctaa tatgttgcca 
gatttctgta tcagtggtca tatttgtgga 
ttgctttcag tgcaattcta attcacacgt 
cttcttcact aagacacaat acggcaaacc 
ataattaacc tcataatcaa gattcaatgt 
gaagattaga tacgtatatt agtagattta 
ctcatagata gtaatatcat tgtcaaattc 
tttaagecta aaagtagtgt attegcatat 
acaaaaaaat cgaaaagaaa ataatctgtt 
gtttctattt aaaaaatget tgaaatttga 
cgatgtctag atactcagaa tttaatcaag 
acccacgcaa aagaaaagac taatatgatt 
tgaaactaag agagtgaaag attcgaaatc 
gaaaaatgaa gatgaaggac taaagaacaa 
aggattcagg tactaggaga agttgaggca 
tatttttaca atttaattca cctaagttat 
gtatagcgta tttacatata tgaaataaat 
tgaatgtccc caegcaaaaa aaaatccctc 
attccaaaat taccataaac ttttggttta 
cctaaatgat tcatgaatcc ttcaataatc 
agagttgega tatttgtaat ttggtgacct 
tttcttaaaa gtaaaacttg attttatgtt 
ttcttgcaaa aagagttcgt taactgeatt 
atcttcatga acattaagaa aaacaatatt 
cttgattcac atacccattc tagactttgg 
tggtaatgea gaagggtaaa aaaaggaaga 
aggagtttca aaagatggtt ctgatgagaa 
ccatgaaaga aatcggatgg tcctccttca 
ttttttcttt cttattatta accatttaat 
ctgtaaaaag aaaatacaca tctcacttat 
agagaataaa aaagtgtgag tttctaggta 
agatgatctg tccatttttt tcttttttct 
aaaactaata accttctgtt ttcagcaact 
ttcaattgaa aacccataaa ccaaaataga 
gaacaacata attagaaaag cagaagcagc 
tttctcttca agaacagttt ctcattaccc 
cttgagaact caggtaaggt tgtgaatatt 
gatttaaaat aaatgtttct tctttctctg 



atggacttga agaggttttg acaatgecag 60 
gccaagtgtc aaattggctt tttaaatatg 120 
tecaaegtat tcatcatcaa gttctcaagt 180 
ttaactttaa catgcatgtc attataatta 240 
tttcagatta tattaatctc cataaatgaa 300 
ttctaaatat atatggacaa aatttacacg 3 60 
gtctttcgtt tgtgcgataa gattaaccac 420 
ctctcggttt agtcgctaaa ttgtatcttt 480 
gaettategt cctaactttt tttttaatta 540 
aaatattttt taagtactcc attaagttta 600 
cagttatgtt caacaatttt gaatcatgag 660 
atgtcttatc aaatttgttg tcactcgagg 720 
tttatttggt ctggatattt ttgtagagga 780 
cacaatgttc aagagagctc aaagcaaaaa 84 0 
taagcaacta cttataccct atttccataa 900 
agttnnnnnn nattgattca aattttcatt 960 
tatgeattte tcatcattgg tacattttct 1020 
taaatatgtc ctcacgttgc aagtagttaa 1080 
caaatatgtc caccttttct tttcttttta 1140 
caaaagattt ctagaaattg aggaagatat 12 00 
tgaagtttgc gatattttcg attttcttca 1260 
taaacttttt ttgataaaga gtaaacgttt 1320 
ttagggttct agctcaactt tgtattatat 1380 
cttcaacact ataaagtgat tatcaaaaac 1440 
tggtttcggt tagagcttgg ttttgcttgg 1500 
cataaatttg atacgataga gagtatctaa 1560 
gagaaaaggt gagaaagatt accaaaaata 162 0 
acagagccca tccctctcct tttccccttc 1680 
atgtcctcca cctactcttc tcttctttct 1740 
taatttcccc ttcaatttca gtttctagtt 1800 
agatatccat atctatttat atgcatgtat 1860 
tgttgagtat gtgctgtttg gacaattgtt 1920 
tctgtgtata aatatatttg agcacaaaga 1980 
agggtcttat aaccttcaaa gaaatattcc 2040 
tattacaaaa ggaaagagag atattttcaa 2100 
agttaagtgg tactgagata aatgatatag 2160 
accttctcct ttttgetgat etategtaat 2220 
atgeaccatt cattaaccct aaaaataaga 2280 
attcttgtgt aaccaattca tgggtttgat 2340 



atgtttcttg gttattgctt atcaacaaag agatttgatc attataaagt agattaataa 2400 
ctcttaaaca cacaaagttt ^ctttattttt tagttacatc cctaattcta gaccagaaca 2460 
tggatttgat ctatttcttg gttatgtatc ttgatcagga aaagggattt gatcatcaag 2520 
attagccttc tctctctctc tctagatatc tttcttgaat ttagaaatct ttatttaatt 2580 
atttggtgat gtcatatatg gatcaatgga ggaaggtggg agtagtcacg acgcagagag 2 64 0 
tagcaagaaa ctagggagag ggaaaataga gataaagagg atagagaaca caacaaatcg 2700 
tcaagttact ttctgcaaac gacgcaatgg tcttctcaag aaagcttatg aactctctgt 2760 
cttgtgtgat gccgaagttg ccctcgtcat cttctccact cgtggccgtc tctatgagta 2 82 0 
cgccaacaac aggtacgctt ctcctactct atttcttgat cttgttttct taattttaac 2880 
taaacaagat cctagttcaa atgataacaa agtggggatt gagagccaag attagggttt 2 94 0 
ggttaattta gaaaaccaga tttcacttgt tgatacattt aatatctctc tagctagatt 3000 
tagtactctc tcctctatat atgtgtgggt gtgtgtgtaa gtgtgtatat gtatgcaaat 3060 
gcaagaagaa gaagaaaaag ttatcttgtc ttctcaaatt ctgatcagct ttgaccttag 3120 
tttcactctt ttttctgcaa atcatttgaa cctgatgcat gtcagtttct acaatacact 3180 
tttaattttg acggcccatc aaatttccta gggtttactt cagtgaacaa aattgggttc 3240 
ttgacacgat ttagcatgta tatataaaaa taggggatga tcaagactta tgtaacctct 3300 
gtctggtgaa actagggaca aagtctactg atgagttgtc actagggatc catttgatca 3360 
tttaatccca acaaaaatga aacaaaattt tgagaattta tatgctgaag tttttcaacc 3420 
ctctttttta aataacttta tattatgtag atttgtattt agggtaattt gtccaactag 3480 
aagtcctaaa aatcaataaa cacacggatg actttgtcta acattgtatc agtcatcaaa 3540 
tgtaaaattg tacaaataat gaaattaaag atttagtctc ttttattttt tttgtttagg 3600 
gtgtatatat atatatatat gtatatttgt tgcattgata tatcaatgag agggagagaa 3660 
ctcagagaag tgtcggaaat taaaatggta cgagccaatt ggaatctctg gcattctgag 3 72 0 
cttcatttgt ttgttattag aaaaaaaaaa aaaaaatcct ttaaagatac cttcatgatg 3780 
acattgaatc atgtaatata cacgatacat ggtctaattc ctcctcaaac cctaattacc 3840 
aatttcgaaa ccataatatt tactagtatg tttatatatc cttactttaa gacattgttt 3900 
gtttataata ccttgtgaat taagaaaaaa aaaaaaaaac ttgtggatct attcaagcca 3 960 
tgtgttagaa taaatttata aattttctcc tcgtactggt cagatattgg tccaaactcc 4020 
aaagccttcc cttttcagga aaaaaaacat ttcgaaatta actctaatta atcaagaatt 4080 
tcctacaatg tatacatcta atgttttttc cgcgatctta cttattagtg tgaggggtac 4140 
aattgaaagg tacaagaaag cttgttccga tgccgtcaac cctccttccg tcaccgaagc 4200 
taatactcag gtaccaattt atattgtttg attctctttg ttttatcttc ttcttttcat 4260 
tatatatatg atcaacaaaa aatataacct acaaaaagag agagttcaag gaaatgcatt 4320 
gaaacggttt cgttatggtg tttgaataca tggatttttg aagtactatc agcaagaagc 4380 
ctctaagctt cggaggcaga ttcgagatat tcagaattca aataggtaat tcattaactt 4440 
ttcatgaact cttcgatttg gtattaggtc acttaatttg gtgtcggtcc aaaagtccgc 4500 
ttgtagtttt ctttagaagt tgttttgttt aatgttcatg tttacaaatt gaaggcatat 4560 
tgttggggaa tcacttggtt ccttgaactt caaggaactc aaaaacctag aaggacgtct 4620 
tgaaaaagga atcagccgtg tccgctccaa aaaggtaaaa tctacgttgc tctctctctg 4 68 0 
tgtctctgtc tctctctcta tatatagtcc cttagtttat atagttcatc acccttttgt 4740 
gagaattttg cagaatgagc tgttagtggc agagatagag tatatgcaga agagggtaag 4 800 
aacgtttctc ccattccaag taattagatc tttcttcgtc tttgtgaggg tttgagtttt 4860 
cccataaatc atgtgtagga aatggagttg caacacaata acatgtacct gcgagcaaag 4920 
gttagccacg ttctgttcca aatcttaatc tcaatatcta ctcttttctt cattgtataa 4980 
ctaagataac gtgaataaca agaaaacttt tgtttttggg tttaatagat agccgaaggc 5040 
gccagattga atccggacca gcaggaatcg agtgtgatac aagggacgac agtttacgaa 5100 
tccggtgtat cttctcatga ccagtcgcag cattataatc ggaactatat tccggtgaac 5160 
cttcttgaac cgaatcagca attctccggc caagaccaac ctcctcttca acttgtgtaa 5220 
ctcaaaacat gataacttgt ttcttcccct cataacgatt aagagagaga cgagagagtt 5280 
cattttatat ttataacgcg actgtgtatt catagtttag gttctaataa tgataataac 5340 
aaaactgttg tttctttgct taattacatc aacatttaaa tccaaagttc taaaacacgt 5400 
cgagatccaa agtttgtcat acaagattag acgcatacac gatcagttaa tagattttaa 5460 
gtgcctttta atatttacat atagttgcag cttcgattag atcatgtcca ccaaacactc 5520 
acaattagag acaagcaaaa ctataaacat tgatcataaa atgattacaa catgtccata 5580 
aattaattat ggattacaaa aataaaaact tacaaaagat ct 5622 



<210> 6 
<211> 6138 
<212> DNA 

<213> Arabidopsis thaliana 
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<220> 

<223> SHATTERPR0QF2 (SHP2) genomic 



<400> 6 

gaattcgtaa cagaatttag tgaataatat 
gatagctcga atatcgttat taaagagtaa 
tctaacattg ttggactctc tattgctcga 
aactatcaag ttgcatttgt aaaaaccaaa 
gaggatagag aagcatgact ccaggcctgc 
taaaaatttg acaaatatag tgtttatttt 
ctacccaact agagcatcaa aacgttttgc 
gagtttttac tcttctttca acagaaactt 
atatatacac aaaagcttga aatttcttca 
acaagccagg taaagaatct ccaacgcatt 
gttctaagaa agaaaaaaat atctcaaagt 
tccaaagtct tgggtttagt atccctatga 
aaaagattct tttctttttc acaagagtta 
gattggacgt atacacaatt tgattatgat 
tggaaatctc tctttcttac ccatttatat 
attatcaatc atcgttcaat ttgataaaga 
agcaatcttt acataattca tctttcttac 
ttctatttga ttatgattat tttgtcataa 
ttatattact tcacgctaat taactcttaa 
caaagcatat actatatatt gagaatatag 
ctctcatcaa gttctttctt gaaatgattt 
tgaatgtgaa aatgtgtgtt gctcgttaag 
gaggaatggg caaaacacaa aagagtttcc 
tgagaaaggg ttcatggata atgactacac 
gctttcattt actttattag tttcatcttc 
ttcttttgtc taaattacgt aatatgatgt 
ccgttggact aacctaatgt ccaagattaa 
aaccagttat aatatatact cttaaattgc 
tactatttac agtttttttt aattggcaag 
ggtagttgac aagttataac actcatattc 
cactcatata aaccacccaa attcttagca 
atttttttta tatataaaat gccaggtaaa 
gggcaataaa aagaaaagga gaataaaaag 
gtgattctga tgagaaacag agcccatacc 
attggatggt cctccttcaa tgctctctcc 
ctttcttttt tcttctttct aatttgatat 
atttcaatct aaatgtatgc atatagaatt 
agaaaatgtt agaagtaatg gtccatgttc 
gtttgaaaaa aaactaccaa accttctgtt 
agaaatattc ctctaaaaga aattataaac 
ttgaagcagc agttaagtgg tactgagata 
aattaactct ctcataattc atcttcccat 
atcttgctaa gccaggtatg gttattgatg 
tttctccaat caaattcttc agttaatcct 
gagtgcaaaa ggatttgatc tatttctctt 
atggagggtg gtgcgagtaa tgaagtagca 
atagagataa agaggataga gaacactacg 
aatggtttac tcaagaaagc ttatgagctc 
gtcatcttct ccactcgagg ccgtctctac 
agctagatct tgattttgtt gaattttttt 
gtttttcaat ctttatggtc tctttatagt 
ctctttaaaa atccccaaaa ttagggtttg 
tttctcttag taaaaaagca gtgagggtct 
cttgttccag tcactttttc tccacaaatc 
tatgtcagtt tgtacaagac acttcgatca 
agattgatca ttaggtttcc atcatagtaa 



tgtaattacc aggcaaggac tctccaaacg 60 
atgatccaat atgtaagcca ttgttgatca 12 0 
aatgatgcat acctaatcat ttattcagtt 180 
catttaaatt cagatttgat atcacttaca 240 
atgcaacaag aaaaaggaag aaaataatgt 3 00 
tattatatga gacagaattt gaataaaatc 360 
aatcgcaata atgaaaccca ttttcttttt 420 
tctcaaacgt ctttagcact gtgacgttag 4 80 
agcaaaagaa tctttgtggg agttaaggca 540 
gttacgtttt catgaaccta tttattatat 600 
aaacgttgga aattttctga tgaagggaaa 660 
atggtatttg gaatatgttt tcgtcaaaac 720 
gtgatcaata acttatgcac taattaatga 780 
acttgagtaa aaatcacctg tcctttaatt 840 
actacttctt ttcattaaaa ttaaatttca 900 
tttaacattt tttgtcacag ggctagtaaa 960 
atatatatat tacctttttc ttcattagta 102 0 
agctagtaaa ttaaacactc gatatgagaa 1080 
cacaacaaga actagtgcat attcaacttt 114 0 
accacgaaag tcaatcaaaa gacctaccag 1200 
tgcagaattt ccaacttaat taattcgaca 12 60 
aaaattgaat agaagtacaa tgaaaatgat 132 0 
tttcgtaact acaattaatt aatgcaaatc 1380 
acatgattag tcattccccg tgggctctct 1440 
tctaattata ttgtcgcata tatgatgcag 1500 
aattaattat caaaataaat attcaaattg 1560 
gacttgaaca taagaatttt ggaaaaacta 1620 
catttctgaa cacaaccaaa taataatata 1680 
aacactgaaa tcttattcat tgtctcgctt 1740 
atataacccc attctaacgt tgacgacgaa 1800 
tattagctaa atattggttt aattggaaat 1860 
tattaacgac atgcaatgta tataggagta 192 0 
ggattaccaa aaaaggaaag tttccaaaag 1980 
tctctttttt cctctaaaca tgaaagaaaa 2040 
ccacccaatc caaacccaac tgtcttcttt 2100 
tttctaccac ttaattccaa tcaatttcaa 2160 
taattaaaag aattaggtgt gtgatatttg 2220 
tttctttctt tttccttcta taacacttca 2280 
ttctgcaaat gggtttttaa atacttccaa 2340 
caaaacagaa accaaaaaca aaaaataaag 24 00 
ataagaatag tatctttagg ccaatgaaca 24 60 
cctcacttct ctttctttct gatataatta 2520 
atttacactt ttttttaaaa gtttcttcct 2580 
tataaaccat ttctttaatc caaggtgttt 2640 
gtgtttatac ttcagctagg gcttatagaa 2700 
gagagcagca agaagatagg gagagggaag 2760 
aatcgtcaag tcactttctg caaacgacgc 2820 
tctgtcttgt gtgacgctga ggttgctctt 2880 
gagtacgcca acaacaggta cacatctttt 2940 
tctagaataa agtttcgact cttctggtgg 3000 
ttttttcctt agtttctctg aagctcaaat 3060 
tttaaaacta gggaacccta ctttaacttc 3120 
tctctgatca ttaattagca tcccccatac 3180 
cttataacag tatctatata tgtatctatt 3240 
atttgatgac ccatcaagtt ttatttctgc 3300 
tgaaaaagta gggttcttga taaaattata 3360 
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* 



ataatatata ttatttggct atataaaaaa 
tagggagaga ctagtaggtg -tttgtcttct 
ttgttaaatc aagaaaatga atcagggaca 
tccgatccat caatcaagaa aaataacgaa 
tctccacttc aagttgggga cttgtcattt 
agctcatgtt catttcacca ttttgattat 
aaaatattgt ttttatttgc ttggctagtt 
taatctataa gtcaagtctc atatcatgga 
caatgtgaac tttcacaacg actaaagaac 
tttgattaac aaaagaggaa tgcattatgt 
tattatcaaa agtgtagtta actttcttat 
tacttttgac atttctcaac ttcagctttc 
gtatatgaat atgaaagcat aacgttatgc 
gcaaaagatt tacaaatatc atgttcttgg 
atgtaaatgt aatgttcaaa tattaattca 
ctagagtagg gaatctattg acttgacctg 
gtgaattttt taatctaatt aggccaaaat 
tgtgtcgtac ataaattggg accaattcca 
tttttatttt cttctccgat ttgctacttc 
cacttttttg gggaagtaca cattagggtc 
atatatatat atataacttt gtgagatgtc 
atatccaaaa aagaaggcgc aaacaaatca 
tttgtcggtt gaatttaagg tttggcgtac 
ctttctgtgt gttttgtata tccagaagat 
tatatatata tatatatata tatatatatt 
acagttatag tttcgtgtgt ctttgtttta 
cgattgcatc tatttacata tatagctacc 
ctttgtgtga atgttgatga agtgtgagag 
ccgacgccgt taaccctccg accatcaccg 
tacacctagc tagctagttc gttaattact 
ttttttcacc tcttgtaaca atgatgggat 
cgtctaaact ccggagacag attcgggaca 
aatctcttgg ttccttgaac tttaaggaac 
gaatcagtcg tgtccgatcc aagaaggtac 
ttgaatatat atccatctga ttcttgcccg 
tgttagttgc agagattgaa tacatgcaaa 
acaatgaact acccctactt tattagcaac 
tctgttgtcg cttgcattgt aggaaatcga 
caaggtttta tacataactc tttttggcat 
tctcttgatg tgcaaattct aaatatctct 
acaagaatcg agtgtgatac atcaagggac 
ccagtcgggg cagtataacc ggaattatat 
ttcctccaac caagaccaac cacctctgca 
tttcctcagc ctgagatcga tctatagtgt 
tagtcgcaag ctgaggggaa ccactagtgt 
acgggtacat tgttgatgtc atgtatgaca 
gaacgtacgt ccgaattc 



gctatgtaga ttccttaaaa attgattcac 3420 
gacacttctc taatcttttg gtgaatcctt 3480 
aagcttattg ttgagtcact taattaatca 3540 
acagaaaatt ttgatttttg attgttattt 3600 
ccgtttttct atacgtttcc agctattaac 3660 
ttgtctgctt tttaaagata aatgttttca 3720 
aatactataa ttgaggttga tgtatgacta 3780 
tctaagttaa aactagtaaa tttgtagttt 3840 
tgatctgaag tttataatgg acatgactaa 3 900 
atgtagaaac atgtgatata tatatgtttc 3960 
ttcaaacacc ctcatgcttt agtagtatct 4020 
cattatacaa cagcacaatg taaattactt 4080 
aaagatttct agcttttctt tttctgtttt 4140 
taaaaacata cttgcctcag ccacatatgc 4200 
ggaaaaacaa agaagaagca aaattagctt 42 60 
aaaatcactt ctttttctta aagcctagta 4320 
atatactagc ctaaaatata atttggattt 4380 
attaactaag agcatatgca attcaaattc 4440 
tttcttttgt atgttttcaa attaggatta 4500 
ttctcgaact ttgattatac atatatatat 4560 
actgttaata gataataggc aataacaata 4620 
tatactatat ggtactggtc cattcactat 4680 
aaactttgtt tcaaaccttt attattccgt 4740 
aaaaatatca atttctttaa cgacttcata 4800 
tttctcttct ggttttagtg tttgaatcca 4860 
cttgtggtgg tttaagtttg agattttcac 4 920 
acaaaaaaga ttgcatttta aaatcttttc 4980 
gaacaataga aaggtacaag aaagcttgct 5 04 0 
aagctaatac tcaggttagc ttttaattaa 5100 
taatttcttc ttcttttagt tatctgacct 5160 
cgaaattgat gaagtactat cagcaagagg 522 0 
ttcagaattt gaacagacac attcttggtg 5280 
tcaagaacct tgaaagtagg cttgagaaag 534 0 
atcactaact ctccatcaat ctccttatca 5400 
ttatatttgg tttttctctc cagcacgaga 5460 
aaagggtaaa agtaaaacct atcttccttc 5520 
ttctctttct gatgatcatc ttttttattt 5580 
gctgcaaaac gataacatgt atctccgctc 5640 
ttttgatcat catttttttc cggtagacaa 5700 
gcagattact gaaagaacag gtctacagca 5760 
agtttacgag tcgggtgtta cttcttctca 5820 
tgcggttaac cttcttgaac cgaatcagaa 5880 
acttgtttga ttcagtctaa cataagcttc 5940 
cacctaaatg cggccgcgtc cctcaacatc 6000 
catacgaacc tccaagagac ggttacacaa 6060 
atcgcccaag taagtatcca gctgtgttca 6120 

6138 



<210> 7 
<211> 51 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : IND1 basic 
helix -loop -helix (bHLH) DNA binding domain 

<400> 7 

lie Ser Asp Asp Pro Gin Thr Val Val Ala Arg Arg Arg Arg Glu Arg 
15 10 15 
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lie Ser Glu Lys lie Arg lie Leu Lys Arg lie Val Pro Gly Gly Ala 

20 » 25 30 

Lys Met Asp Thr A/La Ser Met Leu Asp Glu Ala lie Arg Tyr Thr Lys 

35 40 45 

Phe Leu Lys 
50 



<210> 8 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer for 
amplification of IND1 genomic region of cDNA 

<400> 8 

gatgaaaatg gaaaatggta tgtata 2 6 



<210> 9 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer for 
amplification of IND1 genomic region of cDNA 

<400> 9 

gttcatcagg gttgggagtt gtg 23 
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